Creating hidden Markov models for fast speech by optimized clustering

نویسندگان

  • Robert Faltlhauser
  • Thilo Pfau
  • Günther Ruske
چکیده

Previous studies have shown that the recognition accu racy often severely degrades at higher speech rates which can basically be traced back to two main dimensions acoustic and phonemic Reasons for this e ect can be found in the phonemic eld e g elisions as well as on the acoustic level with increasing rates of speech the spec tral characteristics are changing A main obstacle in this context is the training data consisting of only a small fraction of samples which can be labeled as fast There fore the e ects caused by an increased speech rate often cannot be completely covered To meet this problem in this paper an optimized clustering process is presented making e cient use of the available data Our modi ed mixture splitting algorithm with an incorporated cross validation step aims at increasing the generalization of Hidden Markov Models especially with respect to fast speech Experimental results showed a relative decrease in word error rate of for fast speech

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Syllable-length path mixture hidden Markov models with trajectory clustering for continuous speech recognition

Recent research suggests that modeling coarticulation in speech is more appropriate at the syllable level. However, due to a number of additional factors that can affect the way syllables are articulated, creating multiple acoustic models per syllable might be necessary. Our previous research on longer-length multi-path models has proved that data-driven trajectory clustering to be an attractiv...

متن کامل

Phone set selection for HMM-based dialect speech synthesis

This paper describes a method for selecting an appropriate phone set in dialect speech synthesis for a so far undescribed dialect by applying hidden Markov model (HMM) based training and clustering methods. In this pilot study we show how a phone set derived from the phonetic surface can be optimized given a small amount of dialect speech training data.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999